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1. Introduction 

The goal of the present paper is to investigate the concentration of measure phenomenon 
for norms of infinitely divisible random vectors with independent coordinates. Let X ~ 
ID(j,0, v) be an infinitely divisible (ID) vector without Gaussian component in R d , and 
with characteristic function 

^)=Ec i < i " Y >=exp(i(t, 7 )+ / (c^-l-i^liHi^Mdw)), (1.1) 



where t, 7 € R d and where v ^ (the Levy measure) is a positive Borel measure on R d , 
without atom at the origin and such that J Rd (l A ||M|||)^(dw) < +00 (throughout, (•,■) 
denotes the Euclidean inner product in R d , while || • H2 is the corresponding Euclidean 
norm). Properties of X can be read from properties of v. For example, X has independent 
components if and only if v is supported on the axes of K d , that is, 

d 

vldxi, . . .,Axd) = y^<5p(da;i) • • ■ 5 (dxk-i)i>k(dx k )5o(dxk+i) ■ ■ -5o(dxd) (1.2) 
fe=i 
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for some one-dimensional Levy measures &k, the i.i.d. case corresponding to Vk = v, for 
k = 1, . . . , d. For simplicity of notation, we shall often assume in the sequel that X has 
i.i.d. components rather than merely independent ones. It is an easy matter, left to the 
reader, to transform any i.i.d. result obtained below into an "independent" one; often it 
just involves introducing miiik = i,... t d or Taaxk=i,....d in the notation. 

In a seminal work, Talagrand [13] proved a concentration inequality for the product 
of (one-sided) exponential measures (see also Maurey [9]). This inequality was the first 
to mix two different norms (I 1 and £ 2 ), improving upon some aspects of Gaussian con- 
centration. Rewriting it, in functional form, it asserts that if X is a random vector in 
M. d with i.i.d. exponential components and if / is a real-valued Lipschitz function on M. d 
such that 

3a, p > 0,Vs, yeM. d \f(x) - f(y)\ < mm.(a\\x - yh,P\\x - y\\i), 
then there exists a universal constant K > such that 

P(f{X) - m(f(Xj) >u)< cxp^-Jf min 

where m(f(X)) is a median of f(X). What is remarkable here is the dimension-free 
nature of this concentration inequality. For instance, applying it to the Euclidean norm, 
we note that a = (3 = 1 and that the only dependency in the dimension d is through 
the median itself. This result of Talagrand, which clearly continues to hold for Lipschitz 
images of the exponential measure, is actually true for any law satisfying a Poincare 
inequality (see Bobkov and Ledoux [2]). 

We would here like to obtain dimension-free concentration for infinitely divisible vectors 
with finite exponential moments, of which the exponential measure is a particular case. 
The need to have finite exponential moments to obtain dimension-free results is clear, in 
view of Proposition 5.1 of [13] (see also [1]). On the other hand, by a result of Borovkov 
and Utev [3], one-dimensional laws satisfying a Poincare inequality must have a non- 
trivial absolutely continuous component, thus making our results non- vacuous. The class 
of infinitely divisible laws is quite encompassing and, for example, on R + , any log-convex 
density is infinitely divisible. The situation is more subtle as far as one-dimensional log- 
concave measures (which necessarily satisfy a Poincare inequality) is concerned and, for 
instance, the (infinitely divisible) gamma law with parameters a > and t > 0, and with 
density a t x t ~ 1 e~ ax /T(t), x > 0, is log-concave if and only if t > 1. Let us also mention 
that double Wiener-Ito integrals form another important example of infinitely divisible 
laws (we refer the reader to Sato [12] for a comprehensive introduction to infinitely 
divisible laws). 

For general Lipschitz functions and general ID vectors, generic results have already 
been obtained, but when specialized to vectors with i.i.d. components, they are not 
always dimension- free (we refer to [6] for more precise statements). In fact, it is not clear 
whether or not an extra assumption, such as convexity, might be needed in order to 
obtain dimension-free concentration for generic Lipschitz functions. As shown below, for 
•P-norms, p > 2, we do obtain dimension-free concentration. 

Let us state a first result for the Euclidean norm. 
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Theorem 1. Let X = (X\, . . . , Xj) be an ID vector with i.i.d. coordinates, characteristic 
function (1.1) and Levy measure as in (1.2). Let Ee*" x " 2 < +oo for some t > 0, let 
AI = sup{i > 0:Ee t|Xl1 < +00} and also let I = - logE[e" x i ] . Then, for all x > 0, 

F(\\X\\ 2 >E||X|| 2 +.t) < e -™Po< t <T[**-X>(s)ds] ; ^ 3^ 

where, for < t < M , 

g{t)=U+ l2l °f 2) ) I |«|(e*l u l-l)i?(d«) + | / M 3 (c tM - l)v{du) 
and where T is such that for all t <T , tg(t) < 1/2. 

Inequality (1.3) does recover Talagrand's inequality for the Euclidean norm (up to 
the value of the constants). Indeed, for the symmetric exponential law, in which case 
i>(du)/du = e _ '"'/|u|,u £ K,w =/= 0, we obtain T ~ 0.06. Moreover, since g(0) = 0, there 
exists a C > such that J* 2g(s)ds < Ct 2 for all t < T. Taking t = x/{2C) for x < 
2CT and t = T otherwise, we get bounds of the form exp(— x 2 /(4C)) for x < 2CT and 
Kexp(-Tx) for a; > 2CT. 

We next obtain a result for general £ p -norms, p > 2, but under some assumptions on 
the law of X. 

Theorem 2. Let X be as in Theorem 1 and let X\ be either symmetric or non-negative. 
Let 2 <p< 00. Then, for all < x < h p (M~), 

P(||X|| p -E||X|| p >x)<exp(-^ h-\s)d S y (1.4) 

where the (dimension- free) function h p is given by 

M(e t|u| -l)i>(du), 

where < t < M and where, for any q > 1, m q =E[|Xi| 9 ]. 

The hypotheses on X may seem restrictive, but we shall see that a similar result (see 
Theorem 5 in Section 4) holds under far more general conditions. However, the dimension- 
free bound we obtain in that general framework is more complicated to express. 

Note that Theorem 2 also recovers a bound of the form exp(— min(cx 2 , c'x)), as in Tala- 
grand's result. Moreover, the constant c' is now asymptotically optimal. More precisely, 
suppose that X is infinitely divisible (without Gaussian component), one-dimensional 
and satisfies 



h P (t)=p 2 



41/PI 



i/p 



2 2p+l 



*2p 



logP(|X|>x)--A a; 



(1.5) 
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as x — ► oo, for some constant Ao > 0. Then, for every A < Ao, 

f M(e A l u l -l)p(du)<oo 
Jm 

and, therefore, ft." 1 is well defined on [0,Ao). It follows that if we study the £ p -norm of 
(X\, . . . ,X„), where the Xi are i.i.d. and have the same law as X, Theorem 2 gives, for 
every e > 0, a bound of order exp(— (Ao — s)x) for large x. In view of (1.5), we see that 
this bound is optimal, up to some subcxponential factor. 

For instance, suppose that v is concentrated on R+, has a density k and there exist 
two constants Ao, q > such that, as x — > oo, 

k(x) -x~ q c- XaX , 

where, as usual, x indicates that the ratio of the two quantities is bounded, above and 
below, as r-> oo. Then, if 2 < p < q/2, Theorem 2 gives, for large enough x, a bound of 
the form 

P(||X|| P - Ej|Xj| p > x) < c{x)c- XoX , 

where logc(x) / log a; — > as x — > oo. On the other hand, if p > sup(2, q/2), then for every 
A < Ao, if x is large enough. 

¥(\\X\\ P - E||X|| P > x) < c(x)c- Xx . 

Moreover, if the Levy measure v has bounded support, Theorem 2 gives a bound of 
order exp(— xlogx) for large x. This is known to be the right order of magnitude for a 
Poisson random variable (see, e.g., [6]). In turn, this kind of bound entails the existence 
of more-than-exponential moments. The most precise result we obtain is the following 
dimension-free extension of the results of [6, 11]. 

Theorem 3. Let X be as in Theorem 1, let v have bounded support and let R = 
inf{p: v(\x\ > p) = 0}. Then, 

E [ e (||JC|| a /fl)lQgCA||X|| a /fl) ]<+00j 

for all X such that XV' 2 / R 2 < 1/e, where V 2 = 8 J* R |u| 2 i>(du). 

Further results of a similar flavor, dealing with projections, £ p -norms or integrals with 
respect to a Poisson process, are given in the remainder of this paper. All these results 
are based on a covariance formula that can be derived from a result in [5] . This formula, 
together with its first applications, is proved in Section 2. Theorem 1 is then proved 
in Section 3. In Section 4, we state and prove Theorem 5, which is a generalization of 
Theorem 2. The last section is devoted to the proof of Theorem 3. 
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2. The covariance formula and its first applications 
2.1. The covariance formula 

The result at the root of every proof in this paper is the following one. 

Proposition 1. Let X = {X\,...,X^) ~ ID(^y, 0,v) have independent components and 
be such that Ee^ x ^ 2 < +oo for some t>0. Let f :R d -> R be such that Ef(X) = and 
let there exist b k €E K, k ~ 1, . . . , d, such that \f(x + ue k ) ~ f{ x )\ < bk\u\ for all u 6 R, 
x^M. d . Let M = sup{i>0:Vfc = l,...,d,Ee tbfc l Xfc l <+oo}. Then, for allO<t<M, 



where the expectation E z is with respect to the ID vector (U,V) in M. 2d of parameter 
(7,7) and with Levy measure zv\ + (1 — z)v§, 0< z< 1. The measure vq is given by 



while v\ is the measure v supported on the main diagonal of M. . 

An important feature of this proposition is the fact that the first marginal of (U, V) 
is X and so is its second marginal. Therefore, the main problem in estimating the right- 
hand side of the inequality in Proposition 1 will be to decouple U and V, that is, to split 
the product \f{U + ueu) — /(?7)| 2 e'^ v ) without changing the term e l ^ v \ To do so, a 
first attempt could be to use a supremum. 

Corollary 1. Let X = (X\, . . . ,Xd) ~ ID (jy,0,u) have independent components and be 
such that Ee*H x l |2 < +00 for some t>0. Let f : R d — > R and let there exist b k eR : k = 
1, . . . , d, such that \f(x + ue^ 

) - f{x)\ < b k \u\ for all ueM, x G R d . Let 





vo(du, dv) = v(du)5o(dv) + 5o(du)v(dv), 



u,v€R d , 




where M = sup{t > : Vfc = 1, . . . , d,Ee tb "\ Xk 



< +00}. Then, 



F(f(X)-Ef(X)>x)<e Jo 



(2.1) 



for allOKxKh^iM-). 
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Proof of Proposition 1 and Corollary 1. Below, and throughout, by "/ Lipschitz 
with constant a" we mean that \f(x) — f(y)\ < a\\x — y\\ for all x, y G R d (the Lipschitz 
convention stated in [6] also applies). Let us start by recalling the following simple lemma 
which will be crucial to our approach [5] (or [6] for a sketch of proof). The lemma is the 
infinitely divisible version of the covariance representation for functions of Gaussian vec- 
tors obtained via Gaussian interpolation. Its proof is also obtained via infinitely divisible 
interpolation and, below, the law of the vector ([/, V) is as in the previous proposition. 

Lemma 1. Let X ~ ID(-f,0, v) be such that E\\X\\l < +oo. Let f,g:R d ^R be Lipschitz 
functions. Then, 



Ef(X)g(X) - Ef(X)Eg(X) 



= / E, 



(W + u) - f(U))(g(V + u) - g{V))v(du) 

i 

where E z is as in Proposition 1. 

We then follow [6]. First, by independence, 

C = {t>0:Vfc=l,...,d,Ee' bfc|Xfc| < +00} 



(2.2) 



t >0:Vfc= 1, 



i>k(du) < +00 



Next, we apply the covariance representation (2.2) to / satisfying the above hypotheses 
and moreover assumed to be bounded and such that Ef = 0. Hence, 



Efe tf = J o 



< 



V / (f(U + ue k ) - /([/))(e*(/^+« e *)-/W) - l)i?fc(d«) 
(\ z c^J2 [ \f(U + ue k )-f(U)\ 

JO u—1 Js, 



dz 



x \f(V+ue k )-f(V)\ 



gtbfcM _ \ 

h\u\ 



v k (du) 



dz 



< 



E, 



,tf(v) 



E 

/c=i 



|/(17 + ue k ) - f(U)\ 2 + \f(V + ue k ) - f(V)f 



Q tb k \u\ _ 2 

b k \u\ 



v k (du) 



dz, 



which gives Proposition 1. For Corollary 1, we continue. 

E/e 4/ <h f (t)E[e tf ], 
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where we have used the "marginal property" mentioned above and the fact that h / (i) is 
well defined for < t < M. Integrating this last inequality, applied to / — E/, leads to 

]&.*(/-*/) < «>£ ^ (s)ds , < t < M, (2.3) 

for all bounded / satisfying the hypotheses of the theorem. Fatou's lemma allows us to 
remove the boundedness assumption in (2.3). 

To obtain the tail inequality (2.1), the Bienayme-Chebyshev inequality gives 

¥(f(X)-Ef(X)>x)<exp(- sup (tx- [ h f (s)dsX) = e ~ ^ h f 1( - s)ds 

\ 0<t<A/\ JO // 

by standard arguments (see, e.g., [6]). □ 



2.2. First applications 

In general, the above corollary does not provide dimension-free results, even if it slightly 
improves a result of [6]. However, for particular functions, the above formula can, in fact, 
be quite efficient. As a consequence of the previous corollary, we present some almost 
dimension-free results. First, we have the following. 

Theorem 4. Let X be as in Theorem 1. Let e > 0. Then, for all < x < h{M~), 

P(||*||a > (1 + e)E||X|| a + x) < e'So h " (s)ds , (2.4) 
where the (dimension-free) function h is given by 

h(t) = 8 [ M(e*l"l-l)g(d«)+ I | W | 3 (e*N-l)5(dti). 

Theorem 4 still has some weak dimension dependency via the term eE||X||2 (the ex- 
pectation and the median playing the same role up to some constant). In particular, 
it does not precisely recover Talagrand's result, even for the Euclidean norm. However, 
the function h itself is dimension-free, in that it can be both upper and lower bounded 
independently of the dimension d since for X = (X±, . . . , Xd), dmini = i.... i( ;(E|Xi|) 2 < 
(E||X|| 2 ) 2 <dmax l=1 ,..., d E(X2). 

The advantage of Theorem 4 is that it does not require any additional assumptions, in 
contrast to Theorem 2, and that it recovers the xlogx-type bound when v has bounded 
support, which is not the case of Theorem 1. We refer the reader to [7], whose results are 
sometimes superseded by the present paper, for various applications of Theorem 4. 

Actually, the mild dimension dependency in Theorem 4 is not much of a problem in 
the statistical applications we have in mind. However, a statistician would prefer not 
to have any unnecessary extra assumptions on the variables themselves. Let us explain 
these comments by means of an example. 
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Assume that we observe n Poisson processes on [0, 1] with intensity s with respect to 
the Lebesgue measure. We would like to estimate the function s. A simple way to do it 
is to discretize the problem. So, let d be some integer (usually smaller than y/n if s is 
regular enough) and for all i, 1 < i < d, let iVj be the total number of points that have 
appeared between (i — l)/d and i/d. Therefore, the variables iVj are independent and iVj 
obeys a Poisson law with parameter Si — j l ^ 1 y d s(x)ndx~ (n/d)s(i/d). If we want to 
understand the behavior of the estimator N = (N\ , . . . , Nd) around S = (Si , . . . , Sd) , we 
need to control ||e|| 2 , where we write, for all i, 

Ni = Si + ei. 

We deal here with a regression problem where the noise e= (ex, . . . , e n ) has independent 
components, but these components are not identically distributed. More generally, we 
would like to encompass the case where the noise is centered with independent infinitely 
divisible components. Classical regression corresponds to a Gaussian i.i.d. noise and we 
refer the interested reader to [8] for an extensive study of the link between concentration 
and estimation of the signal S through model selection methods in that framework. 

We would not only like to drop the i.i.d. assumption in Theorem 4 (which is rather 
easy to do), but we would also like to have an inequality which is valid under a very 
mild assumption on the noise. The next corollary only assumes that there exists a known 
bound on the support of the Levy measures, which is, for instance, the case for the 
Poisson problem we described above. 

Corollary 2. Let X <~ ZD(7,0, v) have independent components and be such that = 
inf{/9 > 0, ^-(|a;| > p) = 0} is finite, with R = maxi<k<d Rk ■ Let e > and let 

2 d f 

2 " fc(dM) + ( £ E||xn^ g i y " 4 " fc(dM) - 

Then, for all x > 0, 

F(||X|| 2 > (1 + e)E||X|| 2 + x) < c -/fl-(-/fl+K 2 /fl 2 )iog(i+^/K 2 ). 



V, = 8 max 

Kk<d 



The above result improves upon known concentration inequalities for Poisson processes. 
One can easily prove that in the framework mentioned above with X = N — S , the factor 
appearing in Corollary 2 is of the order 8nB/d, where B is an upper bound on s, as 
soon as s is bounded from below and d << \fn. So, applying Corollary 2 to TV — S, we 
obtain that there exists a constant C > such that for all x > 0, 

P(||JV - 5|| 2 > (1 + e)E||JV - 5|| 2 + X) < c -C^^ 2 d/Bn, x \ ofi (xd/Bn)) 

Applying instead Theorem 4 of [10] to our problem, we see that there exists a constant 
C > such that for all x > 0, 

P(||iV- S|| 2 > (l + e)E\\N- S\\ 2 + x) < e -C ™™{x 2 d/Bn,x) ^ 
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Thus, even in this simplest case (of a Poisson process), we see that Corollary 2 (optimally) 
improves known results by a logarithmic factor. 

However, if one is interested in recovering the function s from the observations of 
the n Poisson processes and if s is not very smooth and varies greatly on a very small 
interval, looking at a regular partition might be rather useless. It might, instead, be much 
more fruitful to look at the function s discretized on very small intervals (d = n) and 
then to look at the projection of the signal S on a space S which is generated by, say, 
a few Haar wavelets. Now, if the signal S is sparse on the Haar basis, it means that 
one can find (and use) a space iS, with a dimension much smaller than n, to provide 
a good approximation for S. However, one now needs to understand the behavior of 
1 1 lis (./V — S)\\s, where Hs is the orthogonal projection on S. As before, we want to deal 
with more general infinitely divisible noise than simply centered Poisson variables. The 
two following corollaries provide such results. 

Corollary 3. Let X ~ ID(^,Q,v) have independent components. Let S be a subspace of 
M. d and 11$ the orthogonal projection on S. Let M = sup{i > 0:V/s= 1, ... ,d, Ee*' Xfc ' < 
+oo}. Let E>0. Then, for allO<x< h(M~), 

P(||IIs(X)|| 2 > E||IIs(X)|| 2 + E + x) < eSo h " (s)ds (2.5) 

and 

P(||nsX|| 2 < EHlIspOHa -E-x)< e-/o" h_1 « d ', (2.6) 
where the function h is given by 

Mi) =8 max f \u\(e^ - l)Pfc(d«) + ^ ||Hs(e fc )||| f \u\ 3 (e^ - l)P k (du) 

forO<t<M. 

The next version, which assumes i.i.d. coordinates, in contrast to the above, can some- 
times be easier to use. For the statistician, the i.i.d. case appears when dealing with a 
regression problem where the noise does not depend on the signal itself. 

Corollary 4. Let X be as in Theorem 1. Let S be a subspace of M. d and let Us be the 
orthogonal projection on S. Let e > 0. Then, for all < x < h(M~), 

P(||IIs(X)|| 2 > (l + £ )y / E||Hs(X)||2 + x)<c-X>" {s)ds (2.7) 

and 

P(||ns(X)|| 2 < E||nspQ|| 2 - ey/E\\Us{X)\\l-x) < eSo h "^ d \ ( 2 .8) 
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where the (dimension-free) function h is given by 

fc(f) = 8 / | u |(e t W-l)i/(d«) + -5| ?5 / \u\ 3 (e^-mdu) 

JR e M l JR 

forO<t<M. 

Finally, in the density framework [4], it is known that the Euclidean norm does not 
suffice to assess the performance of one estimator: if the density s belongs to a Sobolev 
space H a , linear estimators cannot achieve the optimal rate of convergence for the L p - 
norm if p > 2, whereas they can for the L2-norm. This corresponds to sparse signals S 
that can be approximated by their projection on a subspace with small dimension with 
respect to n but for the L p -norm. So, in that context, it is necessary to work with general 
Lp-norms and concentration results for £ p -norms are more relevant than for the ^2-norm. 
Such results are presented in the following corollary with a weak dimension dependency 
and in Theorem 5 without any dimension dependency and with as few assumptions as 
possible on the noise (see Section 4). These results are given in their i.i.d. version for the 
sake of simplicity, but one can easily see their non-i.i.d. version from the given proofs. 

Corollary 5. Let X be as in Theorem 1. Let p > 2 and e > 0. Then, for all < x < 

KM-), ' 

P(||X|| P > (1 + s)n\\X\\v) +*)< e - £ "" (s)ds (2-9) 

and 

H\\X\\p < (1 - e)E(ll*l|p) - x) < e-i> _1 « ds , (2.10) 
where the function h is given by 

for0<t< M. 



2.3. Proofs 

Let us proceed to the proof of the results of the previous subsection. We begin with 
Corollary 3, the other proofs being easier. 

Proof of Corollary 3. We apply Corollary 1 to f(x) = (||IIs(x)||2 — E) + . First, it is 
easily verified that for each k, \f(x + uek) - f{x)\ < |||n s (x + ue fe )j| 2 - j|n s (2:)||2|lA fc , 
where A k = {\\IL s (x + ue k )\\ 2 > E or ||n s (.T)|| 2 > E}. We then have 

| / (, + Mefc) -/(.,)|<^ ns(efc) ' nsW)+u2!|ns(efc) ^ 11 ^ 



|n s (a; + uefc)||2 + ||ns(a;)||2 
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2M|(n 5 ( efc ),n s (x))| u 2 nn 5 ( efc )||j 

\\n s (x)\\ 2 + E 

Moreover, since \f(x + uek) — f(x) \ < \u\, we have 

^ e tb k \u\ _ ^ 



f p tb k \u\ _ i 

E yj/(* + «e fc ) - /(x)i 2 - ^ ^ fc (d M ) 



< V f f 2 \(U s (e k ),U s (x))\ 2 2^||n 5 (e fc )||| \ / e«H-l \ 



k=l 



^2. L( 8u i|n s( x)||i + & 



k=l 



Hence, h f < h. To complete the proof of (2.5), note that ||n s (X)|| 2 -E< (||n s (X)|| 2 - 
E)+ and that E(||n s (X)|| 2 - E)+ < E||n s (X)|| 2 . To get the lower bound (2.6), just 
proceed as above, but with the function f(x) = — (j|ns(ir)|| 2 — E) + and note that 
(||n s (X)|| 2 - E)+ < ||n s (X)|| 2 and that E||n s (X)|| 2 -E< E(||n s (X)|| 2 - E)+. □ 

Proof of Theorem 4. We apply Corollary 3 with S = R d and E = eE\\X\\ 2 . □ 

Proof of Corollary 2. It is sufficient to note that, proceeding as in Corollary 3, h(t) < 
ho(t) = V^(e xR — l/R) and that M = +oo. It remains to integrate the reciprocal of ho. □ 

Proof of Corollary 4. Again applying Corollary 3, let us take E = e^/WJ^RgiX)^). 
Then, note that in the centered i.i.d. case, 



- d / d \ 2 "i 

E[||n s (x)|| 2 ] = E ]T 5> fc (n s (e fc ), ei 
.1=1 \fc=i 

d d 

= ^^E[X fc 2 ](n s (e fc ), ei ) : 
i=i k=i 

= E[x 1 2 ]^||n s ( efe )|| 2 



k=l 



>E[x 1 2 ]^||n s ( efe )||^ 



k=l 



since ||n s (e fc )|| 2 < ||e fc || 2 = l. □ 

Proof of Corollary 5. We apply Corollary 1 to f(x) = (\\x\\ p - eE(\\X\\ p ))+ to get the 
first result. For the second one, it is sufficient to do the same with — (||.t|| p — eE(j|X|| p )) + . 
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The end of the proof is as in Corollary 3. Let x be in M. d . First, it is easily verified that for 
each k, \f(x+ue k )-f(x)\ < |||x+uefc|| p - ||a;||p|lA fc , where A k = {\\x+ue k \\ p > eE(||X|| p ) 
or ||a;|| p >eE(||X|| p )}. Since 

we have 

l /( . + ^)- /W i< Ja±g^£L_ w (2 . 13 , 

But since x t—> x p is convex, we have 

\\x k +u\P-\x k \P\<\(\x k \ + \u\)P-\x k \P\. 
Combining this with the fact that 

Vj/>0 (l + yY-lKpyil + yf- 1 

implies that 



\f(x + ue k ) - /(,)| < mMpAx + Wkh y-^ 
< p\u\(\x k \ + \u\f- 1 



sup(||x|| p ,e]E(||X|| p ))p-i' 
Moreover, since \ f(x + ue k ) — f(x)\ < \u\, we have 

a tbk\u\ _ i 



(2.14) 



<P 2 I JT^]S^-> \(- tH - DP(d«) (2-15) 



E / \f(x+ue k )-f(x)\ 2 D k (du) 

\\\x\ + \m%zi 

sup(\\x\\ p ,eE(\\X\\ p ))^ 

7 K Vsup(||a;j| p ,£E(||X|| p )); 

where / = (1, . . . , 1) g R d and \x\ = (\xi\, . . . , |x d |). Since p > 2, 2p- 2 >p and ||x|| 2p -2 < 
||x|| p , which implies that 

£ / l/(^ + ^ fc )-/(x)[ 2 e ~ 9 k (du) 

f / dV(2p-2) \ 2 P~2 
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Here, again, the upper bound is dimension-free since E(||X|| p ) > E(|Xl|)g£ 1 ' p . □ 

3. Proof of Theorem 1 

Another method to decouple U and V in Proposition 1 is to use the following inequality, 
which is a particular instance of Young's inequality (for the pairs of conjugate functions 
ce x and ylog(y/c) — y, with the optimal c) and has already been used in [8]. 

Lemma 2. Let A > and let X and Y be random variables for which all the expectations 
below exist. Then, 

E[Xe XY ] < E[Yc XY ] + l0gE J cA ' Y] E[c Ay ] - ^S E [ e ^]. ( 3 .1) 

A A 

Proof. Indeed, if 

dQ- 



E[e XY ] ' 
then, by Jensen's inequality, 

XE Q (X-Y) <logE Q (c A(x - y) ). □ 

With the help of the previous lemma, the following holds. 

Corollary 6. Let X = (X\, . . . ,Xd) ~ ZD(7,0, v) have i.i.d. components and be such 
that Ee* l|x|12 < +oo for some t>0. Let f : R d -> R be such that Ef(X) = and let there 
exist b € R such that for all k, \f(x + ueu) — f(%)\ < b\u\ for all u £R, x S M. d . Assume, 
moreover, that for all agK, there exists a function C u such that 

d „ 

/ \f(X + ue k )- f(X)\ 2 v k (Au)<u 2 C u {X) 
fc=i^ R 

and such that E[e x ^ u,t ' Cu ^ x ^\ < oo for X(u,t) > 0. Then, for all t for which all the quan- 
tities below are well defined, we have 

(l-h(t))E[fe^]<g(t)E[e^}, (3.2) 

where 

r f P *% l _ i 

h & = / ir^H — t—^ Au ) 

J R X{u,t) b 



and 



. ln(<t>(u,t)) . ,e* b l u l -I.,, . 
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and where <p{u,t) = E[e A ( u '*) c «W]. 

Proof. Applying Proposition 1 to /, the above assumptions entail that 
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Efe tf < / E 2 
Jo 



dz. 



Next, apply Lemma 2 to \{u,t)Y = tf(V) and to X = C U (U) or! = C U (V). Since Y 
has zero mean, one can ignore the last term in (3.1), and this leads to 



E/e 4/ < 



E 



\(u,t) 



f ) X(u t) (C ^ 6 ^ ^ 



which concludes the proof. 



□ 



(3.2) is non-trivial only when h(t) < 1. One of its applications is to prove Theorem 1, 
with the help of our next two results. 

Lemma 3. For a > 0. let i a = - \\\¥\cT aX i\. Then, for all A > 0, v > such that, l a > 
X/v, 



E( 



Xd 



y cxp y\\m+v 



< 1 + exp 



vA 



- X/v J ' 



Proof. Let £ > 0, to be chosen later. Let a = cx -P( EdE ('x : i) +v ) an d let b = exp(^). Then, 

Xd 



E exp 



\X\\l + v 



edE{X'f)+v 
Xd 



exp 



x i + v 



< a- 





df 


Ad 








hr£ 





<a+ / E[e- QX i] d e" a, ' +QAd/ln *d< 



< a + e-^ +QedE ^ 2 '(6-a) 
< a + e d ( £QE ( x i 2 )+( A /f 



Taking e such that eaE(Xf) + (X/v) — £ a — leads to 
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Lemma 4. There exists positive constants ci, C2, C3 such that for all x £ M. d and mgM, 

Proof. The proof is similar to an argument used in the proof of Corollary 3. We have 

k=l k=l 

But, for all e > 0, 



\x + uek 2 + Mb 



|x + ue fc ||| = ^x 2 + (a* + u) 2 = IMI2 + 2ux fc + u 2 > \\x\\\ - sx 2 k + (I - e x ) 



u 2 . 



Therefore, 

(||x + ue fc || 2 + ||a;|| 2 ) 2 > \\x + ue k \\ 2 + \\x\\ 2 > (2-e)||x|| 2 + (1 -e^u 2 . 

Taking e = 3/2, completes the proof. □ 

With the help of the previous lemmas, we now get the following. 

Proof of Theorem 1. We want to apply Corollary 6 to f(X) = \\X\\ 2 - E\\X\\ 2 . With 
the notation of Lemma 4, 

works. We must then compute ln(</>(u, i)). But, 

ba.(<j)(u,t)) = a\(u,t) +ln(E exp( ^ ' ^ 2 — - 

V L \ll A lb + c 3 u 

and so from Lemma 3, it follows that 

aX(u, t)c 2 u 



bx((j>(u,t)) < ci\(u,t) + lnl 1 + exp 

for all a such that £ Q > X{u 1 t)c 2 / c 3 . 

Taking a = 1 and A(u, £) = 03^/(202) gives 



£ Q - X(u,t)c 2 /c 3 



ln((/>(u,t)) < ciX(u,t)+ln2 + c 3 u 2 , 
which leads to the result by standard arguments. □ 
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4. Proof of Theorem 2 
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We state and prove in this section a generalization of Theorem 2. First, recall that the 
vector X can be viewed as the value at time 1 of a Levy process (X z , z > 0). For every 
z G [0, 1], decompose X = Y z + Z z , where Y z = X z and Z z = X — X z , so that Y Zl Z z 
are independent. If 1 < k < d, we write (Yk) z , (Zk) z f° r the fcth coordinate of Y Z ,Z Z . 
Furthermore, to simplify the notation, we denote 

Y t + = (Y 1 ) t l { Y l)t > (4.1) 

and likewise define Yf , Zf , Z t ~ . 



Theorem 5. Let X be as in Theorem 1 and let 2 < p < oo. Then, for all < x < 
h P (M-), 



¥{\\X\\ p -E\\X\\ p >x)<exp(- J* tftfds^, 



(4.2) 



where the (dimension-free) function h p is given as follows: 
• if X has almost surely non-negative coordinates, 



h P {t)=p 2 



1 



2p-2 



■4 1 



1 



2 i/p 



2p-2 



m 2 p 



u(e tu -l)v{du), (4.3) 



where the moments m2p,m p are defined by m q =E[X*] for q=p,2p; 
in the general case, 



h p {t)=p 2 



2 2p m 2p 
rn 2 



\u\(e tlul - l)P(d«), (4.4) 



where the modified moments mi p ,m p are defined, using the notation given in (4-.1), 
by 



and 



m p = inf [inf{E[|y+ + Z+|P],E[|y- + Z~\"]}] 

z(E[Q.l\ 



m^= sup [snp{E[\Y+ + Zt\ 2p }MY z -+Z;\^]}}. 
ze[o,i] 



When 1 < p < 2, an inequality similar to (4-2) holds, where h p is now replaced by the 
following function h p ,d (which is no longer dimension-free) : 
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Observe that Theorem 2 is a particular case of this more general result. The obvious 
drawback of Theorem 5 is that, except in the cases considered in Theorem 2, we do not 
have a precise control of the quantities m p , m,2 V - In particular, if m p = 0, then h p = oo 
and we get a trivial bound. However, it should be clear that when x does not have almost 
surely positive coordinates, the quantity 



inf lux 

ze[s,i 



f{E[|y; 



\Y~ 



z;\ p }}} 



is positive for every e > 0. So, the only case when m p might be zero is the case when 
(Xi) t , the first coordinate of the Levy process X±, taken at time t, has a probability 
tending to 1 or to to be positive as t — > 0. This kind of behavior does exist, but in 
most 'natural' examples, this is not the case and then Theorem 5 does give a non-trivial 
bound, although the expression of this bound is not always easy to handle. 

Proof of Theorem 5. The proof can be divided into three steps. Define the function 
f(V) = \\V\\ p -E\\X\\ p . 



Step 1: We claim that 

\f(X + ue k )-f(X)\<p\u\ 



. (11*11? + M") 1 /* 

Step 2: Using the notation of Proposition 1, we have 

d 



p-i 



(4.5) 



A:=E Z 



B:=E Z 



£l/(^ + «efc)-/(*0l 2 e t/(V) 



< 



V 



\u\ 2 F{ P ,d,u)E[e tf( - v) ] 



,fe=i 



<p 2 \u\ 2 F(p,d,u)E[e tf ^} 



for some function F{p, d, u) that will be made explicit in the course of the proof. 
Step 3: For h p and h p ,d as in Theorem 5, we have, if p > 2, 



and if 1 <p< 2, 



E(/e 4/ ) < h p (t)E(e tf ) 



E(fe tf )<h p4 (t)E(c^). 



Integrating the inequalities of this last step leads to Theorem 5. 
Proof of Step 1. First, for all reals a,b> 0, 



max(a, b) 1 ^ — min(a, b) 



l/p _ 



\a-b\ 



i/pr 



l/p 



\a-b\ 



a 



\a-b\ 



2 

1/p 



\a-b\ 
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Since the function x i— ► (1 + x) 1 ^ — (1 — x) 1 ^ is convex on [0, 1] , is zero at zero and 2 1 / p 
at 1, we get 

M/p^ f a + b Y /P foi/p\ a - b \\ _ \ a - b \ (4g) 



(a + fe)^- 1 )/?' 



Next, we want to apply (4.6) to a = b = \\X + ue k \\P. Put Ck = u/X k and distinguish 

between three cases. First, if Ck > 0, then 

and 

\X k + u\f + \X k \P>2\X k \P + \u\P. 
Second, if —2 < c k < 0, then 

iiXfc+^-i^ri^Mi^-r 1 

and we can check that for every A > 0, 

|-X: fc | < (l + |c fe |)|X fc | 



(A + \x k + u\p + \x k \py/p - (A + \x k \p + \ Ck \p\x k \py/p- 

Third, if c k < -2, then 

WXk+ulP-lXklPl^pluWXk + ulP-^plulUl + CkWX^P- 1 
and we can check that for every A > 0, 

|l + c fe ||X fc | < (l + \c k \)\X k \ 



{A + \x k + u \p + \x k \pf/p - (A + \x k \p + \ Ck \p\x k \py/p- 

Combining the inequalities in these three cases with (4.6), we obtain (4.5). 
Proof of Step 2. We will use the following well-known result. 

Lemma 5. Let T be a random vector in M. d with i.i.d. components and let A,B:l 
be two functions. For every i <d and every x E R d , define the functions A X: i, B x 



A Xji (t) = A(xi,. . .,Xi- U t,x i+ i, . . .,x d ), 
B x ,i(t) — (xi, . . .,Xi-i,t,Xi+i, ...,Xd). 

Assume that for every i < d and every x € one of the two functions A x .i,B X: i is 

non- decreasing and the other one is non-increasing. Then 

E[A(T)B(T)] < E[A(T)}E[B(T)} . (4.7) 
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Proof. When d = 1, the proof is obtained by writing 

E[A(T)B(T)} - E[A(T)]E[B(T)] = ^~E[(A(T) - A(T')){B(T) - B(T'))], 

where T' is an independent copy of T, and using the monotonicity assumptions. When 
d > 1 , we proceed by induction. □ 

Let us first bound A. Summing (4.5) over k, the triangle inequality for || • || 2p— 2 gives 

- \\Vh P -2 + \u\dWr-Q \ 2p - 2 ctf(v) 



A<p z \u\ 



(\\v\\ p p + H p ) 1/p 



T£2p-2>p, \\V\\ 2 p-2 < \\V\\ P and, otherwise, \\V\\ 2 p-2 < ||^|| p d 1/(2p - 2) - 1/p . Conse- 
quently, 



2l,|2t, 



A<p z \u 



Qp 



\ | U |d 1 /(2p-2) 



where the polynomials Q p are defined as follows: 

• ifp>2, Q p :x^(l + l/x) 2p - 2 ; 

• if p<2, Q p :x^ (d 1 /(2p-2)-l/p + 1 / a .)2p-2^ 

We can apply (4.7) to T = \\V\\ P since Q p { | lt |Ji /(2p _ 2) ) is decreasing in \\V\\ pi while 



otf(V) 



is increasing. This gives 



2i„.|2n 



A<p z \u 



f (\\v\\ p p + H p ) 1/p 

\ | U |d 1 /(2p-2) 



E[e'^)]. 



Next, let us bound B. Summing (4.5) over k gives B <p 2 \u\ 2 T> with 



V = E- 



V |«|d 1 /(2 P -2) 



Note that the distribution P z of (J7, V) is such that for all z e [0,1], t/ = Y z ' + Z z , 
while V = Y z + Z z , where Y" z , Y^' and Z z are independent and Y z and Y z ' are identically 
distributed. Hence, 



P = E 2 



p\l/p \ 



,*/(n+z*) 



| U |d 1 /(2p-2) 

Let us introduce the function M p : for all real u, y,y',z in R d , 



M p (M,y,2/,z) = 



/ y "'■sgn(t/ i ) = sgn(i/J)=sgn(z i ) l^j + Hi \ P 



1 VP 
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Then, M p (\u\,y 7 y' , z) < (\\y' + z\\p + \u\ p ) 1 ' p and since Q p is decreasing, 

T) < jff Q | M p(\ U \> Y zi Y z, Z z) \„tf(Y z + Z z ) 
| U |dV(2p-2) 
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(4.8) 



From now on, we split E z into E,Y zt Y'^-z z , meaning that we first integrate according to 
the law of Z z . Let us fix Y z and Y' z . Suppose, first, that (Zk) z , (Yk)z and (Yl) z have 

the same sign. Then, one of the two functions Q p ( Mp | [j^ji/^'J^f * ' ) ; e t ^ Y " +z ^ is non- 
decreasing in (Zk) z , while the other function is non-increasing in (Zk) z . Next, if two of 
the three reals (Zk) z , {Xk)z and {Yl) z have a different sign, then the function M p is 
constant in (Zk) z - Hence, we can use (4.7) and if we denote by / the set of indices for 
which (Zk) z is positive, we get 



lQ{l,...,d} 
IC{l,...,d} 



( M P (\u\,Y z X,Z*) \tf<Y z+ z.u 
M p (\u\,Y z X,Z z ) 



(z k ) z >o iff {fee/} 



Qp 



|«|di/(ap-2) 



| W |d 1 /(2p-2) 

Integrating in Y' z and then in Y z leads to 



E z= [e 



t/(Y*+Z,)l 



t/(y.+^)i 



{(z fc ),>o iff fce/}J 



X> < sup Ey/ z 2 



M p (|uU,rj,^) 

| U |d 1 /(2p-2) 



where there is no index z in the last expectation since all the marginals are the same. In 
this way, we have proven the second step, with 



F(p, d, u) = sup Ey j 2 z 



Q, 



M p (\u\,y,Y<,Z z ) 
|u|d 1 /(2 P -2) 



Proof of Step 3. Since Q p is continuous and decreasing, let us denote by V p its 
reciprocal. In order to bound F(p,d,u), we start by evaluating the probability 



Y'Z Z 



M p (\u\,yX,Z*) 

| U |d 1 /(2p-2) 

M p (\u\,yX,Z z ) 



> s 



| u |dl/(2p-2) 



< VJs) 



which is, in fact, zero if s > a = Q p (l/d 



l/(2p-2) 



)• 



Suppose that y has A: positive coordinates and d — k negative coordinates. Let J+ be 
the set of i such that yi>0 and let I_ be the set of z such that t/j < 0. Set 

m+(z)=W\Y+ + Z+n 
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m q (z) = sup(m+(z), m q (z)), 
and define m~(z) and m q (z) likewise, using Y~ and Z~ . Then, 
'M p (\u\,yX,Z z y 



Y!,Z Z 



> S 



,z. 



Y'Z % 



\ u \ d l/(2p-2) 

j2 \y*+z+\*+ x; \Yr+z-f<dp/^\u\pv p ( S r- 



iei- 



\ Y z + + Zt\ p + J2 \ Y *~~ + Z -\ V - d p/{2p - 2) \u\ p V p (s) p 



iGI- 



Now, if s is such that d p ^ 2p 2 ^ \u\ p V p (s) p < dm p (z)/2, which is equivalent to saying that 

- d (p_2)/(2p-2) mp( ypl/ PN 



s>b = Q p 



then 



Y'Z, 



M p (\u\,yX,Z z ) 

| U |d 1 /(2p-2) 



2\u\p 



> S 



< 



< 



Y'Z, 



Y'Z. 



Y,\Y^ + +zf\ p +j2\Yr+z-\ p < 



dm p {z) 



iei-ir 



iei- 



Y,\^ + + zt\ p + Y,\ Y r + z;\ p 

dm p (z) 



iei. 



iei- 



< krrip(z) + (d— k)m p (z) 



Using the Bienayme-Chebyshev inequality, we obtain 



Y1,Z, 



M p (\u\,yX,Z z ) 

| u |dI/(8p-2) 



> s 



<4 



kmt p {z) + {d - k)m 2p {z) 



d 2 m p 2 {z) 



< 



Integrating this last probabilistic inequality gives 

'M p {\ulyX,Z z ) 



E 



Y'Z. 



Q, 



| M |d 1 /(2p-2) 



4m 2p (z) 
dm p 2 (z) 



a J m p (z) 



(4.9) 
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Recalling that a = Q p (l/ d 1 ^ 2 ^), so that a/d < 2 2p - 2 , this entails 

F(p,d,u)<b + 2^^\. (4.10) 

Ulp \ZJ 

Then, using Proposition 1, where all the bk are here equal to 1, together with Step 2 and 
(4.10), we get Step 3. 

The positive case. When X has almost surely non-negative coordinates, the following 
improvements are possible. First, (4.5) can be replaced by 

\ X A , U.I \H 



|/(X + ,e fc) -/(Z)|<p|,l^ (2||x £ +| ; | ; )1/p j . (4.11) 

Indeed, we can use the proof of (4.5), but here we only have to consider the case Ck > 0. 

Second, if X has non-negative coordinates, we can apply (4.7) directly, without intro- 
ducing M p . In this way, (4.10) can be replaced by 

F(d, P ,u)<b + i(l + ¥rp ) ^J, (4.12) 
where m p and m2 P now stand for the usual moments, that is, m q = ¥\X q ] for q = p, 2p. □ 



5. Proof of Theorem 3 



Let 

x + (\x 

5(x)= i? log u 

Note that g is a bijection from [R/\, +00 [ to [0,+oo[. To prove Theorem 3, it is thus 
sufficient to determine for which A, Ec 9 "' x " 2 ' converges. Let e > and let c = max((l + 
e)E\\X\\ 2 ,R/\). Then, 

r + co r+00 

Ee g(\\xh) = p( c a(W x h'> >t)dt<c 9{c) + f>( c 9(\\xh) > t )dt. 

Setting t = e 9 ( c+u ' in the last integral and applying Corollary 2, we get 

r+00 

E e ff(IW| 2 ) < e 9(c) + / c u/R-(u/R+V s 2 /H 2 )lo S (l+Ru/V s 2 ) 

Jo 



1 1 ,/Ac+u 

— + — log + 

R R \ R 



e (c+«)/Rlog+(A(c+«)/R) du 
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For this last integral to converge, it is sufficient that the power of e u l R be negative, that 
is, that 

1 + log+ (^Vlo g fl + ^<°, 



R J b \ V 2 , 

at least for u large enough. But, if u tends to +00 in the expression above, we obtain 
that 

( XV 2 \ i? 2 e _1 
1 + lo s( -fw- ) < °) that is > A < 



\ r 2 ) ' ' v; 2 ' 

Next, we need to pass from V e to V. But, if A < R 2 c~ 1 /V 2 . then there exists e (quite 
large, of course) such that A < R 2 e~ 1 /V 2 and this concludes the proof. 
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